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Modal interpretations constitute a particular approach to associating dynamical variables with 
physical systems in quantum mechanics. Given the 'quantum logical' constraints that are typically 
adopted by such interpretations, only certain sets of variables can be taken to be simultaneously 
definite- valued, and only certain sets of values can be ascribed to these variables at a given time. 
Moreover, each allowable set of variables and values can be uniquely specified by a single 'preferred' 
projector in the Hilbert space associated with the system. In general, the preferred projector can 
be one of several possibilities at a given time. In previous modal interpretations, the different pos- 
sible preferred projectors have formed an orthogonal set. This paper investigates the consequences 
of adopting a non- orthogonal set. We present three contributions on this issue: (1) we provide 
an argument for such non-orthogonality, based on the assumption that perfectly predictable mea- 
surements reveal pre-existing values of variables, an assumption which has traditionally constituted 
a strong motivation for the modal approach; (2) we generalize the existing framework for modal 
interpretations to accommodate non-orthogonal preferred projectors; (3) we present a novel type 
of modal interpretation wherein the set of preferred projectors is fixed by a principle of entropy 
minimization, and we discuss some of the successes and shortcomings of this proposal. 
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I. INTRODUCTION 

> 

In operational quantum mechanics, theoretical predictions take the form 'if such-and-such a measurement is made 
after such-and-such a preparation, such-and-such an outcome will be found with such-and-such a probability'. In 
contrast, a realist interpretation is an attempt to understand quantum mechanics as making stronger claims of the 
form 'such-and-such a variable has such-and-such a value with such-and-such a probability'. The 'elements of reality' 
of Einstein, Podolsky and Rosen |Q, and Bell's 'be-ables' are two ways of referring to the variables that possess 
definite values in a realist interpretation. We will simply refer to them as the determinate variables. Since the set of 
determinate variables in some sense specifies 'what exists', we call it the ontology for the system. The specification 
of the values of the determinate variables will be called the value ascription to the ontology. Within this approach, 
one assigns a property to a system by assigning a value to a determinate variable. Since the ontology and the value 
ascription together constitute a complete specification of the properties of a quantum system, they will jointly be 
' referred to as the property ascription. 

Within 'orthodox' interpretations of this type, a variable is determinate if and only if it is associated with an 
operator for which the state vector is an eigenstate, and its value is the corresponding eigenvalue. It is also assumed 
that a variable V defined on a subsystem is determinate only if V I is determinate on the total system, where / is 
the identity operator for the part of the total system that is not included in the subsystem. According to these rules, 
the ontologies and the value ascriptions for all systems are uniquely defined by the state vector. It is widely recognized 
■ that this view, together with the assumption that the evolution of the state vector is unitary for all time, leads to the 
quantum measurement problem, namely, the failure to ensure the determinateness of macroscopic variables such as 
the pointer reading of an apparatus 

One approach to the problem is to introduce a non-unitary dynamics for the state vector into the formalism of the 
theory (the 'collapse' of the state vector). A different approach is to preserve the unitary dynamics, but to reject the 
notion that a variable is determinate only if it has the state vector as an eigenstate of the associated operator. In 
the latter type of approach the property ascription need not be fixed at a given time by the state vector. Rather, 
it may be that the state vector describes only the set of possible property ascriptions, in which case it describes 
what is possible and what, if anything, is necessary. Since the logic of possibility and necessity is modal logic, realist 
no-collapse interpretations of this type have been called modal interpretations of quantum mechanics Q . 

Modal interpretations typically impose many constraints on the form of the property ascription for a system. Given 
these constraints, there is always a unique 'most elementary' possessed property defined by a property ascription. 
We call this the preferred property for that property ascription. Since at a given time the property ascription may be 
one of several possibilities, each of which define a different preferred property, there is in general a set of preferred 
properties associated with a system. An example may serve to clarify these concepts. Suppose the system is a digital 
display on an apparatus. The property ascription for the display may include such properties as 'the digital display 
shows a number between 1 and 3', 'the digital display shows a number smaller than 5', etc., while the preferred 
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property may be 'the digital display shows the number 2'. The set of preferred properties may consist of a list of 
properties each of the form 'the digital display shows the number k\ but differing in the value of k. In previous modal 
interpretations, the preferred properties have been associated with orthogonal projectors. 

There are three contributions made in this paper. First, we demonstrate that any modal interpretation which 
adopts the standard constraints upon the property ascription and which seeks to satisfy a particular criterion of 
faithful measurement must allow for the set of preferred properties to be associated with non-orthogonal projectors. 
Second, we introduce a framework for modal interpretations that incorporates such preferred properties. Third, we 
present a novel proposal within this framework wherein the preferred properties are fixed by a principle of entropy 
minimization. 

The paper is organized as follows. In section 2, we present a review of the constraints upon the property ascription 
that are standard among modal interpretations, and we provide a rigorous definition of the notion of a preferred 
property. In section 3, we present the argument that the preferred properties must be associated with non-orthogonal 
projectors if one hopes to explain the outcomes of perfectly predictable measurements in terms of pre-existing proper- 
ties of the system under investigation, that is, if perfectly predictable measurements are to be faithful. The argument 
relies on a particular kind of experiment, involving a sequence of two measurements which have the following critical 
features: (1) the first measurement disturbs the state of the system differently for different outcomes, resulting in the 
preparation of non-orthogonal states; and (2) the variable measured by the second device depends on the outcome of 
the first measurement in such a way that the outcome of the second measurement is always perfectly predictable. 

In order to accomodate non-orthogonal preferred properties, we require a new framework for modal interpretations, 
which is the subject of section 4. We preserve most of the standard constraints on the property ascription, in 
particular constraints involving the functional relations between the values of variables. However, we show that one 
must abandon the assumption that different property ascriptions share a common ontology. We assume Healey's so- 
called 'weakening condition' S, and adopt Clifton's rule [|| for relating the properties of composites to the properties 
of the subsystems of which they are composed. Moreover, we follow previous authors [Q (|] in requiring that the 
dynamics of the property ascription be Markovian and satisfy certain constraints of analyticity, while also reproducing 
the standard quantum statistics. Guided by these constraints, we introduce a framework for modal interpretations 
that incorporates non-orthogonal preferred projectors. This constitutes a generalization of the framework introduced 
by Bub and Clifton JT§. 

With this framework in hand, we proceed in section 5 to present a novel proposal for a modal interpretation. We 
begin by assuming that there is a distinguished division of the universe into elementary subsystems, or equivalently a 
distinguished factorization of the total Hilbert space. A preferred decomposition of the state vector is singled out by 
the minimization of a particular entropic quantity that quantifies the degree of entanglement of the state vector with 
respect to the distinguished factorization. The actual property ascription is assumed to be fixed by a single clement 
of this decomposition, whose identity evolves by a stochastic process with specified statistical properties. Within 
this proposal, we demonstrate that the quantum measurement problem is avoided for several models of measurement 
interactions, and a large class of perfectly predictable measurements are shown to reveal pre-existing properties. In 
section 6, we present our concluding remarks. 

II. THE MODAL APPROACH 
A. Review of Constraints 

We begin by considering the notion of a property of a physical system. The type of properties in which we are 
interested are those of the form 'having a value of the variable V in the range A'. For every such property, one can 
associate an idempotent variable that has value 1 if the value of V is in the range A and if the value lies outside 
this range. Whether a property is possessed or not is given by the value of this idempotent variable; it is possessed if 
the value is 1, and it is not possessed if the value is 0. 

In classical mechanics, a variable is represented by a function on phase space, and the possible values of this variable 
are just the values in the range of this function. The property of 'having a value of the variable V in the range A' 
is associated with the subset of phase space containing all points for which V is in the range A. For instance, if the 
system is a one-dimensional harmonic oscillator, the property 'having energy between E\ and E-i is associated with 
an elliptical ring in phase space, while the property 'having position between x\ and x-i is associated with a vertical 
band. Suppose the property s is associated with a subset of phase space. The idempotent variable associated with 
the property s is the function Fa that takes the value 1 for every point in O and for every point outside Q. If two 
properties s and s' are represented by subsets f2 and f2' of phase space, then the disjunction of s and s' is represented 
by the union of f2 and O', the conjunction of s and s is represented by the intersection of f2 and f2', and the negation 
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of s is represented by the complement of tt. The truth tables appropriate for conjunction, disjunction and negation 
in classical logic place the following constraints on the values of the idempotent variables: 

-Fhufi' = Fn + Fq> — fh-Fh' 

Fnnn' = FqFq/ 

% = 1 - Fa, 

where U, R and an over-bar respectively denote union, intersection and complementation of subsets of phase space. 

We now consider an approach to realist interpretations of quantum mechanics that parallels those features of 
classical theories described above, except of course that the mathematical structure relevant for the description of a 
system is no longer phase space but Hilbert space. This approach has its origin in the field of quantum logic JTl| , and is 
adopted by most modal interpretations. A variable is represented by a Hermitian operator over the Hilbert space, and 
its possible values are the eigenvalues of this operator. Properties are associated with subspaces of Hilbert space. The 
idempotent variable associated with a given property is represented by the projector onto the corresponding subspace. 
If two properties s and s' are represented by subspaces S and S' , then the disjunction of s and s' is represented by the 
linear span (direct sum) of S and S' , the conjunction of s and s' is represented by the intersection of S and S' , and 
the negation of s is represented by the orthogonal complement of S. These assumptions will be called the constraints 
on logical connectives. 

For simplicity, we denote both the projector onto the subspace S and the associated idempotent variable by Ps, 
and denote the value of this idempotent variable by [Ps]- Analogously to the classical case, we adopt the following 
constraint on the values of the idempotent variables: 

Functional relation constraint 

[Pses>] = [Ps] + [Ps>]-[Ps][Ps'] (1) 
[Psns>] = [Ps][Ps>] (2) 
[P S ±] = 1-[P S ], (3) 

where ©, n and denote respectively linear span, intersection, and orthogonal complement. 

As it turns out, it is impossible to associate values with all the projectors in a Hilbert space in a way that is 
consistent with the functional relation constraint and the constraints on logical connectives |l2|. The response of 
modal interpretations is to associate definite values with only a subset of all the projectors in the Hilbert space. 
Thus, in contrast with classical mechanics, only a subset of all idempotent variables correspond to well-defined 
properties at any given time. The projectors that are associated with definite values are labelled determinate, as 
are the corresponding idempotent variables. The functional relation constraint is only required to hold among the 
determinate projectors. 

As regards non-idempotent variables, we adopt the convention that a variable is denoted by the same symbol as the 
associated hermitian operator, for instance V, and that its value is denoted by [V]. Moreover, following other modal 
interpreters Jl3[ , we adopt the attitude that if the spectral resolution of a Hermitian operator is V = VkPs k > where 
Vk 7^ for all k, and Vk 7^ Vk' for k 7^ k', then V is determinate if and only if all of the projectors in the set {Ps k }k 
are determinate, and in this case its value is [V] = Vk[Ps k ]- We call this the spectral constraint. 

It follows from this constraint that the set of determinate idempotent variables and their values are sufficient to 
specify the set of all determinate variables and their values. As noted in the introduction, we will refer to the set of 
determinate variables as the ontology, and the values of these variables as the value ascription. The ontology and the 
value ascription together define the property ascription. 

We turn now to constraints on the nature of the ontology. It is typically assumed that logical combinations of well- 
defined properties are also well-defined. Thus, if property s is well-defined, then so too should be the property 'not 
s\ In other words, if Ps is determinate, then P s ± (= I — Ps) should be determinate as well. Similarly, if properties 
s and s' are well-defined, so that the associated projectors Ps and Ps' are determinate, then the properties 's or s" 
and 's and s" should also be well-defined, and the associated projectors Ps&S' an d PsnS' should be determinate. In 
summary, we require 

Closure constraint: 

If P s 6 Out then P s ± E Out 

If {P s ,Ps>} € Out then {P S(S s>, PsnS'} € Ont, 

where Ont denotes the ontology. Assuming that for every system there is at least one projector that is determinate, 
it follows from the closure constraint that the identity operator and the projector onto the null space are determinate 
for every system. 

We refer to all of the constraints on the ontology and value ascription that have been presented thus far as the 
algebraic constraints. 
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B. Preferred Projectors 



We now derive a few consequences of the algebraic constraints. For this purpose, it is useful to introduce a notational 
convenience: l Pg > Ps'' denotes that 5' is a subspace of S, and ( Ps -L Ps 1 ' denotes that <S and S' are orthogonal 
subspaces. First, we show that in every property ascription there is a projector P$ that receives the value 1, and for 
which no projector onto a proper subspace of S receives the value 1; that is, there is a subspace <S such that 

[Ps] = 1 and there is no subspace T such that Pr < Ps and [Pr] = 1. (4) 

Such a subspace always exists since at least one projector, namely the identity operator, always receives the value 1. 
Such a subspace is unique because if S and S' were two distinct subspaces satisfying this definition, then both would 
receive the value 1, and by the functional relation constraint their intersection would also receive the value 1, which 
implies that either S or S' has a proper subspace that receives the value 1. We call the unique projector satisfying 
Eq. (Q) the preferred projector for the property ascription. The property associated with this projector is also called 
preferred. 

A modal interpretation must account for the fact that a measurement device may have one of several different 
properties at the end of a measurement despite there being a single state vector for the universe. This is accomplished 
by assuming that the property ascription is not fixed by the state vector, but may be one of several possibilities 
at a given time. There are two notions of possibility that are adopted by modal interpreters in this context. In 
the first, the different possibilities for the final property ascription to the measurement device are attributed to 
differences in the initial properties (possibly hidden) of the system. In the second, the different possibilities for the 
final property ascription to the measurement device arise from an objective stochasticity in the evolution of the 
property stateQ. Although we adopt the latter view in subsequent sections, for the present it suffices to note that in 
all modal interpretations one associates with a system, at every time, one out of a set of several possible property 
ascriptions. Since each such property ascription defines a preferred projector, there is in general a set of preferred 
projectors associated with a system at a given time. We now consider the relation between the elements of this set. 

We begin with a definition: two properties are said to be mutually exclusive if their conjunction is a contradiction. 
This is stronger than simply being distinct, as is illustrated by the properties 'red' and 'red or blue', which are distinct, 
but not mutually exclusive. We also use the term 'mutually exclusive' to describe two property ascriptions if there is 
a property obtaining in one that is mutually exclusive to a property obtaining in the other. We assume that for all 
systems at all times, the different possible property ascriptions are mutually exclusive. This assumption can be recast 
as a constraint upon the set of preferred projectors. Recalling the constraints on logical connectives, for two property 
ascriptions to be mutually exclusive there must be two projectors P-r and Pr,/ such that [Pr] = 1 in the first property 
ascription and [Pr.] — 1 in the second, but such that the intersection of 1Z and TV is the null space. By definition, 
the preferred projector for the first property ascription, call it Ps, must be such that Ps < P-r, and the preferred 
projector for the second, call it Ps>, must be such that Ps< < Pr<, from which it follows that the intersection of S 
and S' must also be the null space. Thus we conclude that the preferred projectors for mutually exclusive property 
ascriptions are associated with subspaces whose intersection is the null space. Note that we have not concluded that 
the preferred projectors are orthogonal. Indeed, the possibility of preferred projectors that are non-orthogonal will be 
the focus of much of this paper. 

III. NON-ORTHOGONAL PREFERRED PROJECTORS 

A. The Faithfulness Criterion 

One can debate the merits of assuming orthogonal preferred projectors in the context of a macroscopic system 
such as the pointer on a measurement device. On the one hand, the distinguishability of the different physical states 
of a pointer suggest that they must be associated with orthogonal projectors; on the other hand, the requirement 
of quantum-classical correspondence suggests that the alternative positions of a pointer should be associated with 



1 An example of the first approach is the Bub-Clifton interpretation when the preferred variable has a continuous unbounded 
spectrum, and the dynamics is given by a guidance equation analogous to the one used in Bohmian mechanics. An example 
of the second approach is the Bub-Clifton interpretation when one adopts a different guidance equation, or when the preferred 
variable has a discrete spectrum. See, for instance, section 5.2 of Ref. [H] . 
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projectors onto a set of coherent states, or some similar over-complete and non-orthogonal basis. We do not take a 
stand on this issue here. We do provide an argument for adopting a non-orthogonal set of preferred projectors, but it 
appeals to the properties that should be assigned to microscopic rather than macroscopic systems. In particular, we 
consider a microscopic system that is the object of investigation in a quantum measurement. 

Since operational quantum mechanics only makes reference to the properties of macroscopic preparation and mea- 
surement devices, the requirement of agreement with the operational theory does not by itself constrain the properties 
that are assigned to the microscopic systems under investigation. But if the properties of such microscopic systems 
are to play any explanatory role in the theory, then one would expect their role to be in determining the outcomes 
of measurements upon them. In particular, we consider the following criterion for assigning determinate status to a 
variable: 

Faithfulness Criterion If it can be predicted with probability 1 that a measurement of the variable V will yield 
the result v, then immediately prior to the measurement the variable V is determinate with value v. 

The motivation for adopting this criterion is presented in the next subsection. It is nonetheless worth emphasizing 
at this point its importance in this paper: the particular form of the framework for modal interpretations that is 
presented in section 4 and the particular proposal presented in section 5 are both to a large extent attempts to satisfy 
the faithfulness criterion. 

This criterion is applicable to experiments involving a sequence of measurements, the first of which may be con- 
sidered a preparation. Within the context of operational quantum mechanics, we consider a sequence of two mea- 
surements, associated with distinct Hermitian operators, V and V, both belonging to the Hilbert space TL S . For 
simplicity, we assume that these operators have non-degenerate eigenvalues, denoted by {vk}™ =1 and {v'j}jL 1 respec- 
tively, and eigenvectors denoted by {|^>fc)}fcLi and {j^)}^! respectively. In order to predict the outcome of the 
second measurement, it is necessary to also specify how, if at all, the first measurement disturbs the state. Suppose 
then that upon obtaining outcome k for the first measurement, the state \(pk) is prepared, and that the set of vectors 
{|<Pfc)}feLi, although normalized and non-collinear, arc non-orthogonal. Since it follows that \(pk) ^ \fk) f°r onc or 
more values of k, we say that the measurement is disturbing. For simplicity, the second measurement is assumed 
to be non-disturbing, and the two measurements are assumed to be immediately consecutive. Finally, we take the 
preparation procedure that precedes the first measurement to be associated with a state vector c& \<fk) > where 

J2k \ c k\ = !> and c fe f° r a11 k - 

Operational quantum mechanics predicts, via the generalized Born rule, that the probability of the second apparatus 

indicating outcome j given that the first apparatus indicates outcome k is 

Prob(j|fc) = |(^)| 2 . 

It follows that in order for the outcome of the second measurement to be predictable with probability 1 given the 
outcome of the first measurement, it must be the case that (<pk\<Pj) — 1 for some values of k and j. Hence, the variable 
V measured by the second apparatus must have at least one of the states in the set {l^fc)}^! as an eigenstate. We 
define a set of variables {V(fc)}fcLij such that the variable Vtk) has \ipk) as an eigenstate. In particular, we denote the 
eigenvalues of V^) by {v(k)j}jLi, and the associated eigenvectors by {\f(k),j)}jLn and take |^(fc),i) = \<Pk) ■ It then 
follows that if the first measurement has outcome k, so that \(pk) is prepared, and if V = V(k), that is, the second 
apparatus measures V/m, then with probability 1, the second measurement has the outcome 1. Hence the faithfulness 
criterion is applicable in this case, and implies that V( k ) is determinate with value V( k ),i at time t, immediately prior 
to the second measurement. 

We now introduce a critical assumption about the sequence of measurements: the nature of the second measurement 
is taken to depend on the outcome of the first. In particular, we imagine a set-up where if the outcome of the first 
measurement is k, then the second apparatus measures the variable V(uy, we imagine that this is done mechanically 
by the measurement apparatus, without the intervention of a physicist. In this case, the faithfulness criterion is 
applicable for all possible outcomes of the first measurement. 

We now show that for the faithfulness criterion to be satisfied for such a sequence of measurements, the preferred 
projectors must be non-orthogonal. Since there is a non-zero probability for the first measurement to have the 
outcome k for every k, it follows from the faithfulness criterion that there is a non-zero probability for the system 
to possess the property [V^)] = v (k).i immediately prior to the second measurement, for every k. If a property has 
non-zero probability of being possessed, it is a possible property. Since the projector P ( f, k associated with the property 
[V(k)} = v {k).i is one-dimensional, it has no non-null proper subspaces, and thus by definition it is the preferred 
projector for the feth property ascription. Thus, the set of preferred projectors is {-P<^}fcLi- Finally, since the set of 
vectors {\<fik)}k%i is by hypothesis non-orthogonal, the set of preferred projectors must also be non-orthogonal. 
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For clarity, we briefly repeat this argument in the context of a simple example, illustrated in Fig. 1. Suppose the 
variables being measured correspond to the components along different spatial axes of the spin operator, S, for a spin 
1/2 particle. Denote the component along axis n by S • n, and the eigenstate associated with eigenvalue ±H/2 by 
±n) . Suppose the first measurement is along z, so that V = S • z, \ipi) = |+z) , and ^2) = |— z) • Suppose moreover 
that the state |+x) is prepared if the outcome of the first measurement is — z, while no disturbance occurs if the 
outcome is +z, so that = |+z) , and \<p^) = |+x) . Now suppose that the manner in which the nature of the 
second measurement depends on the outcome of the first is the following: if the first measurement has outcome +z, 
then the second measurement is of S ■ z, while if it has outcome —2, then the second measurement is of S • x. Thus, 
Vm = S ■ z and V(2) = S ■ x. Assume the initial state of the spin is c\ |+z) + c 2 \— z) , where |ci| 2 + | C2 1 2 = 1 and 
ci,c 2 ^ 0. 

Using the generalized Born rule, it is straightforward to verify that the result of the second measurement is pre- 
dictable with probability 1 given the result of the first measurement. It follows from the faithfulness criterion that 
if the outcome of the first measurement is +z, then S • i is subsequently determinate with value +K/2, and if the 
outcome of the first measurement is — z, then S • x is subsequently determinate with value +K/2, Since both of these 
options occur with non-zero probability, the properties [S • z] = +H/2 and [S • x] = +H/2 are both possible. It follows 
that the preferred projectors are P+± and P+a, which are non-orthogonal. 



B. Motivation 

We now consider the reasons for adopting the faithfulness criterion. As is argued by Redhead |Q, in seeking a 
realist interpretation of quantum mechanics one is seeking an explanation of the successes of the operational version 
of the theory. One way to secure an explanation of a measurement outcome is to demand that the properties of the 
systems involved ensure this outcome. The faithfulness criterion goes beyond this however, in that it specifies the 
form that such an explanation must take. Specifically, it is assumed that the reason a variable V is found to have 
value v in a measurement that is predictable with probability 1 is because immediately prior to the measurement V 
is determinate and has value v. Although this is perhaps the simplest form the explanation could take, it is not the 
only form, as is evidenced by Bohm's theory |l5j and Bell's be-able interpretation 0], where only the outcomes of 
measurements of certain variables (position in Bohm's case and lattice fermion number in Bell's case), are taken to 
reveal pre-existing values of these variables. 

So we see that the faithfulness criterion is not a necessary feature of a realist interpretation. Nonetheless, there 
have been many attempts to ensure that the outcomes of perfectly predictable measurements do reveal pre-existing 
values of these variables. 

This tradition dates back to von Neumann, who assumed that the determinate variables of a system and their 
values are fixed by the density operator for the system, p(t), by what we shall call the orthodox rule, namely, 

Ont(t) = {V\Vp{t) oc p(t)} 

[V] t = Tr{Vp{t)), (5) 

where Ont(t) indicates the ontology at time t, and [V]t indicates the value of V at time t (this is simply the rule 
adopted by the 'orthodox' realist interpretations discussed in the introduction). Given that after a non-disturbing 
measurement of the variable V with outcome v, one can predict, with probability 1, that the outcome of an immediately 
consecutive ideal measurement of V will also be v, the faithfulness criterion demands that V be determinate with 
value v prior to the second measurement. However, given the orthodox rule, this can only occur if the density operator 
after the first measurement is a projector onto an eigenstate of V associated with eigenvalue v. This must be the case 
regardless of the density operator prior to the first measurement. Thus, in order to satisfy the faithfulness criterion, 
von Neumann assumed that upon measurement the state vector undergoes a non-unitary evolution (the so-called 
'collapse') to the eigenvector associated with the outcome of the measurement. As a realist interpretation of quantum 
mechanics this proposal is at best incomplete since it fails to specify, in terms of the primitives of the theory, the 
conditions under which a collapse occurs. 

Many modal interpretretations also attempt to satisfy the faithfulness criterion, but unlike von Neumann, they 
abandon the orthodox rule rather than assuming collapse. For instance, it has occurred to many authors, including 
Kochen |l6|], Healey |ij and Dieks that by assigning determinate status to the projectors in the spectral resolution 
of the density operator one can satisfy the faithfulness criterion for ideal measurements. Modern versions of this 
approach include the proposals of Vermaas and Dieks |lq |, and Bacciagaluppi and Dickson [Q. However, it was noted 
by Bacciagaluppi and Hemmo that the Vermaas and Dieks proposal failed to satisfy the faithfulness criterion for 
certain non-ideal measurements, specifically, disturbing measurements. The same argument can be applied against 
the Bacciagaluppi and Dickson proposal. 
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These results do not rule out the possibility that some new proposal involving a different, but still orthogonal, choice 
of preferred projectors might satisfy the faithfulness criterion for non-ideal measurements. However, by considering an 
experiment wherein the nature of the second measurement depends on the outcome of the first, we have shown that 
the faithfulness criterion fails to be satisfied for any modal interpretation that adopts orthogonal preferred projectors. 

Thus any modal proposal seeking to satisfy the faithfulness criterion must allow for non-orthogonal preferred 
projectors. However, a satisfactory proposal must provide an unambiguous rule for identifying the set of preferred 
projectors for every system at every time, and it remains to be seen whether there exists any rule that consistently 
satisfies faithfulness. This rule must also satisfy other constraints, such as predicting properties for macroscopic 
systems that are in accord with our everyday perceptions of them. In particular, it must yield a solution to the 
measurement problem. It may be that the preferred set cannot be chosen to satisfy faithfulness for all measurements 
while also satisfying these other constraints. If this were true, it would certainly remove some of the motivation for 
pursuing a modal interpretation in the tradition of the authors specified above. We are not able to rule out this 
possibility here. Nonetheless, the range of measurements for which faithfulness is satisfied can at least be expanded 
if one assumes non-orthogonal preferred projectors, as we demonstrate in section 5 by a specific proposal, 

C. Consequences for the ontology 

In a modal interpretation, the property ascription to a system at a given time can be one of several possibilities. 
A question which we now address is whether or not these possibilities should differ with respect to the ontology they 
ascribe. Most previous modal interpreters have assumed that they should not. In such interpretations, the possible 
property ascriptions differ only with respect to the value ascription to a single common ontology. However, as we now 
prove, such an approach is unable to accomodate non-orthogonal preferred projectors. 

Theorem 1 It is not possible for there to be, at a given time, several possible mutually exclusive property ascriptions 
which (1) satisfy the algebraic constraints, (2) do not differ with respect to ontology, and (3) are associated with 
preferred projectors that are non-orthogonal. 

Proof. The proof is by contradiction. Suppose the ontology and preferred projector for the fcth property ascription 
(in the set of possible property ascriptions at a given time) are denoted respectively by Ontk and Pfc. Since by hypoth- 
esis the possible property ascriptions do not differ with respect to ontology, there exists a single set of determinate 
variables, denoted by Ont, such that Vfc : Ontk — Ont. Since Vfc : P& E Ontk, it follows that {PfcjfcLi S Ont. In other 
words, if there is only a single possible ontology at a given time, the preferred projectors for all the different property 
ascriptions must simultaneously be part of this ontology. Since one of the possible property ascriptions must actually 
obtain, one of these projectors must receive the value 1. Moreover, since by hypothesis the preferred projectors are 
non-orthogonal, it follows that the common ontology includes several non-orthogonal projectors, one of which receives 
the value 1. However, this is in contradiction with the algebraic constraints, as we now demonstrate. 

Suppose that Ps and Ps> are two non-orthogonal preferred projectors, and that [Ps] = 1. Since the two property 
ascriptions associated with these are by assumption mutually exclusive, the intersection of S and S' is the null space. 
Moreover, by the definition of a preferred projector, no non-null proper subspace of S or S' can be determinate. By 
closure and the fact that Ps< is determinate, P(s>) ± 1S also determinate. By closure and the fact that Ps and P(S') ± 
are determinate, the projector onto the intersection of (<S')~ L and S must also be determinate. Since no non-null 
proper subspaces of S can be determinate, the intersection of (S') and S cannot be a non-null proper subspace of 
S. Moreover, this intersection cannot be S itself, since then S and S' would be orthogonal, contradicting our initial 
assumption. Thus, the intersection of (S') ± and S must be the null space. It then follows from the functional relation 
constraint that [P(s')x][Ps] = 0, and since [Ps] = 1, this implies that [P/gnx] = 0. It also follows from the functional 
relation constraint that [P(s')x] = 1 — [Ps'L so that [Ps'] = 1. Thus, both Pg and Ps< receive the value 1. But, this 
is in contradiction with [Ps][Ps'] = which follows from the fact that the intersection of S and S' is the null space. 
QED. 



IV. AN INTERPRETIVE FRAMEWORK INCORPORATING NON-ORTHOGONAL PREFERRED 

PROJECTORS 

A. Preliminaries 

In the previous section it was established that in order to consider a modal interpretation with non-orthogonal 
preferred projectors, the different possible property ascriptions to a system must differ with respect to the ontology 
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they ascribe. The precise form of the ontology associated with a particular property ascription has not yet been 
specified. It turns out that this form is fixed if an additional constraint on the property ascription is adopted, namely, 



Weakening Condition If P s e Ont and [P s ] = 1 then for all Pn such that P n > P s . P n € Ont and [Pn] = 1. 

This condition was introduced by Healey J^] and was resurrected recently by Vermaas |^(|. It is called 'weakening' 
since in Healey 's terminology Pn is said to be weaker than Ps if Pn > Ps- It is motivated by the same sorts of 
considerations that lead one to adopt the closure constraint and the functional relation constraint; it is an attempt 
to preserve the logical structure of classical mechanics. In the language of properties, the weakening condition states 
that if property s is well-defined and holds for the system, then any property implied by s, namely any property of the 
form 's or s" should also be well-defined and hold for the system. It should be noted that the weakening condition is 
unlike previous constraints, insofar as the nature of the ontology is made to depend on features of the value ascription. 
We now demonstrate the form of property ascription that results from adopting the weakening condition. 

Theorem 2 The algebraic constraints and the weakening condition imply that the set of determinate projectors and 
the value ascription to these must respectively have the forms 



where Ps is the preferred projector for the property ascription. 

Proof. Recall that the preferred projector Ps is the unique projector in the property ascription satisfying Eq. 
(|J), so that [Ps] = 1 and there is no subspace T such that Pj- < Ps and [Pt] = 1- By the weakening condition, 
[Ps] = 1 implies that the set of projectors {Pr|Pr > Ps} is determinate. Moreover, for any projector Pu orthogonal 
to Ps, Pu + Ps is determinate, since (Ps + Pu) G {Pn\Pn > Ps}- It then follows from the constraint of closure that 
(I — Ps)(Ps + Pu) = Pu is determinate. Thus, all projectors orthogonal to Ps, namely the set {Pu\Pu -L Ps}, are 
also determinate. In summary, all the projectors in the set {Pn\Pn > Ps or Pn -L Ps} must be determinate. We 
now show that the projectors in this set are the only projectors that are determinate. 

Suppose the contrary, namely that there exists a determinate projector Py such that Py ^ Ps and Py /LPs- If 
Pv ^ Ps then the intersection of V and <S is not equal to S, and must therefore be a proper subspace of S. It then 
follows from the functional relation constraint and the assumption that all proper subspaces of S receive the value 
that [Py][Ps] = 0. Since [P s ] — 1, we conclude that [Py] = 0. Moreover, since Py /LPs is equivalent to (I — Py) ^ P s , 
it follows by the same argument that [I — Py] = 0. But [I — Py] = implies [Py] = 1, thereby yielding a contradiction. 

Finally, we demonstrate that the value ascription must be of the form of §7§. Given that [Ps] = 1, it follows 
trivially from the weakening condition that [Pn] — 1 if Pn > Ps- Moreover, [Pn] — if Pn -L Ps since otherwise the 
intersection of TZ and S, which is the null space, would receive the value 1. QED. 

Theorem 2 identifies the set of idempotent variables that are determinate given the weakening condition and 
the algebraic constraints. The set of non-idcmpotcnt variables that are determinate then follows from the spectral 
constraint. Specifically, we have 

Corollary If the set of determinate projectors and their values are given by Eqs.(||) and (^), then the spectral 
constraint implies that the ontology and its value ascription are given by 



where Ps is the preferred projector for the property ascription. 

Proof. The spectral constraint states that a non-idempotent variable V is determinate if and only if all the 
elements of its spectral resolution are determinate. Thus every variable V that is determinate has a spectral resolution 
V = J2k ^kPn k , where Vfe : (Pn k > Ps or Pn k -L Ps)- But the latter condition is equivalent to Vfc : Pn k Ps K Ps, 
from which it follows that VPs oc P s . The spectral constratint also states that [V] = J2k ^k[Pn k ]- Since the Pn k are 
orthogonal, only one can satisfy Pn k > Ps, and thereby receive the value 1 by Eq. ([?]). Labelling this projector by k', 
we have [V] = \ k >, and VP S = \ k >Ps, from which Eq.(|) follows. QED. 

The corollary to theorem 2 states that a variable is determinate if it has the subspace associated with the preferred 
projector as an eigenspace, and the value of this variable is the associated eigenvalue. This has the form of the 
orthodox rule, defined in Eq. (0), but where the role of the density operator is played by the preferred projector. 



{Pn\Pn>Ps or P n ±Ps} 



(6) 




(7) 



Ont = {V\VP S oc Ps} 
[V]=Tr(VP s ), 



(8) 
(9) 
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Note that for systems of dimensionality 3 or greater, theorem 2 implies that the ontologies associated with mutually 
exclusive property ascriptions are necessarily distinct. This holds true for such systems even if the preferred projectors 
for the property ascriptions are orthogonalQ In this sense, the weakening condition provides another reason, indepen- 
dent of the one provided in section 3.3, for allowing the possible property ascriptions to differ in ontology. Such an 
arg ument was in fact made by Vermaas in the context of the Vcrmaas and Dicks version of the modal interpretation 

Thus far, we have focused upon the property ascriptions for individual systems, and nothing has been said concerning 
the relationship between the property ascriptions to composite systems and the subsystems of which they are formed. 
Clifton jfj] has argued for the following constraint on this relationship, which we call the reductionist rule: 

V A e Ont A if and only if V A <g> I B £ Ont AB , and [V A ] = [V A <g> I 3 }, (10) 

where Ont A is the ontology of system A 1 and AB is the composite of systems A and B (i.e. A{B) denotes the 
system associated with Hilbert space H A (Ti. B ), and AB denotes the system associated with 7i A H B ). Denying this 
constraint leads to what Clifton has called ontological perpectivalism, the view that what exists depends on the level 
of compositeness of the description. For instance, to deny the 'only if half of the rule amounts to claiming that it is 
possible for part A of a composite to have the property s, while the composite itself does not have the property that 
part A has property s. Clifton has characterized such a position as 'metaphysically untenable'. 

If one adopts the reductionist rule, the property ascriptions for all subsystems are uniquely fixed by the property 
ascription for the composite. However, we also wish to assume that the property ascriptions for every system satisfy 
the algebraic constraints and the weakening condition. It has yet to be demonstrated that these constraints are 
consistent with the reductionist rule. In fact they are. Specifically, if the property ascription for the composite has 
the form given in Eqs. (||) and (^|), with a preferred projector denoted by P AB , then the property ascription for 
subsystem A also has the form given in Eqs. (|^) and (^), where the preferred projector, denoted by Pg, is the unique 
projector satisfying 

Ps B <P£®I B and there is no subspace T A such that Pg B < P^ <Z> I B < Pg <g> I B '. (11) 

(In other words, Pg is the 'smallest' projector satisfying Pg ®I B > Pg B ) This can be shown to be a limiting case of 
a result by Dickson and Clifton H], however for clarity we prove it directly. It suffices to demonstrate the following 
equivalences: 

{P A \P A ®I B > P AB } = {P A \P A > P A } 

and 

{P A \ p a ®i b ± p ab } = {p a \p a j_ p£}, 

where Pg and Pg B are related as above. We first demonstrate that the right hand sides imply the left. P£ > Pg 
trivially implies P£ <S> I B > Pg <g> I B ', and since by definition, Pg <g> I B > P£ B , it follows that P£ <g> I B > Pg . 
Similarly, Pg _L Pg trivially implies P£ <g> I B 1 P^ <g> I B , and together with P£ ® I B > P£ B ', this implies that 
P-r ® P 3 -L Ps B ■ To show that the left hand sides imply the right, we make use of the fact that Pg <g> I B is the 
'smallest' projector satisfying Pg (8 I B > Pg B ■ This implies that any projector P^ satisfying P^ ® I B > Pg B must 
also satisfy P£ ® I B > Pg ® I B , and hence P£ > Pg - In addition, any projector P£ satisfying P^® I B _L Pg B 
(equivalently (I A - P£) ® I B > P^ 3 ) must also satisfy (I A - P£) ® I B > Pg ® I B which implies P^ ± Pg - This 
concludes the proof. 

Clearly, the preferred projectors for a subsystem can be non-orthogonal if the preferred projectors for the composite 
are non-orthogonal. What is perhaps more surprising is that the preferred projectors for a subsystem can be non- 
orthogonal even if the preferred projectors for the composite are not! For example, suppose the preferred projectors 
for the composite are two orthogonal projectors, P AB — P A (g> P B and P£ B — P A ® P B , where P B and P B are 
orthogonal projectors, but P A and P A are not. It then follows from Eq. ( pj| ) that the preferred projectors for A are 
simply P A and P A , which are non-orthogonal. 

In the next subsection, we will introduce a framework for interpretation wherein the possible property ascriptions 
for the universe are defined first, in accordance with the algebraic constraints and the weakening condition, and the 



2 This is not true for a 2-dimensional Hilbert space, since two distinct property ascriptions can be associated with the same 
ontology. This occurs when the preferred projectors for these property ascriptions are orthogonal. 







possible property ascriptions for all subsystems are then inferred using the reductionist rule. The preferred projectors 
for the universe will be assumed to be orthogonal, but as shown above, this is consistent with the preferred projectors 
for a subsystem being non-orthogonal and hence does not rule out the possibility of satisfying the faithfulness criterion. 
A more general approach would be to assume a non-orthogonal preferred set for the universe as well. However, the 
faithfulness criterion does not necessitate this assumption, and indeed, as we will demonstrate in section 5, one can 
satisfy this criterion for a wide variety of measurements without it. The case of an orthogonal preferred set for the 
universe is in any event a natural place to begin such an investigation. 

The framework that emerges is similar to the one proposed by Bub and Clifton flpfl . The most significant difference 
is in the form of the property ascription, since the latter do not assume the weakening condition. Another difference 
is in the dynamics of the property ascription. Bub and Clifton defined a dynamics following Vink p|| and Bell [||. 
This approach was subsequently generalized in two respects by Bacciagaluppi and Dickson j7j, and Dickson First, 
the preferred projectors were allowed to be time-dependent, and secondly it was shown that there is a plurality of 
possible dynamics consistent with the quantum statistics. We follow the latter, generalized approach. 

Since many of the ingredients of the framework derive from a number of sources, and since we introduce some novel 
terminology, we have written the rest of this section in such a way that it constitutes a self-contained description of 
the framework. 



B. Details of the framework 



It is assumed that the universe is associated with a Hilbert space Ti. and a vector \ip(t) ) G H that evolves determin- 
istically over time in accordance with the Schrodinger equation, 

±m))=-iH\i,(t)}, (12) 

where H is the total Hamiltonian, and where the units are chosen such that h = 1. Since, as will be demonstrated 
shortly, the role of the vector \4>(t)) in the framework is to determine the probabilities of various different property 
ascriptions as well as their dynamics, it will be dubbed the dynamical state vector. 

Define a decomposition D of a vector \tp) as a set {(c)., °f non-zero coefficients Cfc and orthonormal 

vectors \(f>k) such that \ip) = J^feLi c fc l<^fc) • It is assumed that every interpretation within the framework selects a 
preferred decomposition of the dynamical state \ip(t)) at every time t. The projectors onto the elements of the preferred 
decomposition constitute the preferred projectors for the possible property ascriptions to the universe. 

We also introduce a new 'state vector' that we denote by |$(t)) . It can be any one of the vector elements of the 
preferred decomposition. The projector onto this vector is the preferred projector for the property ascription to the 
universe that obtains at time t. Assuming the algebraic constraints and the weakening condition, it follows from 
Theorem 2 that the property ascription for the universe has the form of Eqs. (||) and (|^), which may be rewritten in 
terms of |$(f)) as 

Ont{t) = {7|V|$(t)) oc|*(i)>}, 

\v\ t = m)\vm))- 

The property ascription to any subsystem of the universe is then fixed by the reductionist rule, defined in Eq. (JlCj) . 
Since | determines the property ascription to every system, we call it property state vector. 

Next, we introduce a restriction on the manner in which the elements of the preferred decomposition can evolve 
over time. Suppose the set of vectors {\(t>k(t)}}t = i at every time t is a complete orthogonal basis for TL that includes as 
a subset the vector elements of the preferred decomposition. We require that there is an indexing of the basis vectors 
such that every vector with a given index is an analytic function of time. We call this the constraint of analyticity. 
It can be satisfied by requiring that the time-dependent vectors in the set {|0fe(i))}fe = i each define a path through 
Hilbert space obeying the equation 

j t \ct> k {t)) = -iH{t)\<t> k {t)), (13) 

for some Hermitian operator Hit). It is convenient to refer to these vectors, considered as functions of time, as the 
preferred paths. 

It is assumed that the property state vector evolves according to a Markovian stochastic dynamics that permits 
hopping among the preferred paths. We require that at every time t the probability pk(t) that the property state 
vector lies on the kth preferred path is given by 
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Pk (t) = \{Mt)m))\ 2 



(14) 



The latter requirement is called the Born rule constraint. Although the basis {|0fe(f)}}fc=i that is defined at time t 
by the preferred paths may include elements that are not part of the preferred decomposition, these elements have no 
overlap with \ip(t)) , so that the probability associated with them is zero. It follows therefore that the property state 
vector always corresponds to one of the vector elements of the preferred decomposition. There are many dynamics 
that satisfy the Born rule constraint; these will be considered in the next subsection. 

We refer to this entire interpretive structure as a 'framework' for modal interpretations, since there are a plurality 
of possible interpretations that have this form. Specifically, there is a different interpretation for every choice of rule 
for determining the preferred decomposition and every choice of dynamics that satisfies the Born rule constraint. 



C. The general form of the dynamics 

We now recall the general form of a Markovian stochastic dynamics that satisfies the Born rule constraint [Q . This 
constraint, articulated in Eq.(|T4|), can be recast as constraints upon the initial conditions [ p4[ and the dynamics: 

Pk(o) = |<<M0)|V(0)}| 2 , 



and 



j t p k (t) = j t \{Mt)\m)\ 2 - as) 



Using Eqs.(^2|) and (|l3|), the latter becomes 

j t p k (t) = 2lm[(m\Mt)) (Mt)\H - H(t)\m)} ■ (16) 

Since we assume Markovian dynamics, it is sufficient to specify the probability T k j (t)dt of a transition from path 
j to path k during the infinitesimal interval between t and t + dt, for all j and k. The evolution of a probability 
distribution p k (t) over the paths is then given by the master equation 



j t Vk(t) = PWOjC*) - T jk {t) Pk {t)\ . 

3 

In what follows, we consider the problem of finding a set of functions T k j(t) that satisfy the master equation given 
p k (t). Following Bell it is useful to define a new set of functions, namely a set of probability currents, Jkj(t), as 
follows: 

J kj (t) = T kj (t) Pj (t) - T ]k (t) Pk (t). (17) 

The current Jkj{t) describes the net flow of probability from path j to k at time t. This definition implies that the 
current is antisymmetric with respect to an interchange of its indices 

Jfei(i) = ~Jjk(t). (18a) 
In terms of these currents, the master equation becomes a continuity equation: 

j t P k {t)=Y,JkM (19) 

3 

Following Bacciagalup pi a nd Dickson B, one can solve for the T k j(t) in two steps. First, one finds a set of currents 
Jkj(t) that satisfy Eq. (|l8a| ) and that solve Eq.([l9|) with dp k (t)/dt given by Eq. ([l6|) . Next, one finds a set of functions 
T k j(t) that solve Eq.(|17|) given a particular solution for J k j(t). It turns out that there an infinite number of sets of 
antisymmetric currents which solve the continuity equation. Moreover, for a given set of currents, there are an infinite 
number of solutions for the Tkj(t), specifically, any set of functions that satisfy 

T kj >max{0,^}, (20) 
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and 



= (T kjPj J k] ) _ (21a) 
Pk 

for every pair of indices k > j. 

So we see that there is a large number of solutions for the dynamics which satisfy the constraints introduced. It is 
possible that additional constraints, such as a requirement of quantum-classical correspondence, might eliminate the 
ambiguity in the choice of dynamics, but this has yet to be demonstrated and some authors argue that it is unlikely 



V. THE MINIMAL ENTROPY PROPOSAL 
A. Details of the proposal 

We begin by introducing some terminology. A factorization F of a Hilbert space TL is defined to be a set of 
Hilbcrt spaces each of dimensionality greater than one, the direct product of which is TL, that is, F — {TL^}p =1 , 
such that U = TL {1) <g> TL (2) ■ ■ ■ <g>W (n) , and dhn(H&>) > 1. A more precise definition of this concept is supplied 
by Bacciagaluppi |2j|, but this is not required for our purposes. A factorization containing n elements is called 
n-partite, and the elements themselves are called factor spaces. A factorization F is said to be a coarse-graining of a 
factorization F' , and F' a fine-graining of F, if F' can be generated from F by factorizing one or more of the elements 
of F. Finally, a product decomposition of \tp) with respect to the factorization F = {Ti^}p =1 is any decomposition 

{( c fc) ^ir'^OIfcLi 01 IV'} every element of which is a product state over F. 

The first element of the proposal is to assume that there is a factorization of the Hilbert space of the universe that 
is more physically relevant than the others; we call it the distinguished factorization . There is a precedent for such 
an assumption, specifically, in the modal interpretations of Healey ||, Bacciagaluppi and Dickson and Dieks p6[ . 
Such interpretations have been called 'atomic', since the factor spaces of the distinguished factorization represent 
the most elementary physical systems. Some restrictions on what the distinguished factorization could be will be 
discussed briefly in section 5.2. 

The first constraint upon the preferred decomposition is that it be a product decomposition with respect to the 
distinguished factorization. This constraint is not sufficient to uniquely specify a decomposition. Indeed, the number 
of product decompositions of any state vector with respect to a given factorization is infinite. In order to distinguish 
between these, we turn our attention towards the coefficients in the decomposition. Since these coefficients define 
a probability distribution, different decompositions can be ordered with respect to the uniformity of the associated 
distributions. This uniformity can be quantified by several 'entropic' quantities. The most obvious candidate is the 
Shannon entropy, defined for a probability distribution p = (jpi,p2, ■■■,Pm) as 



H(p) = -Y,Pklogp k . (22) 

k=l 

Thus, we can associate with every decomposition D = {(c^, \4>k))Yk=i °f the state vector the entropy 

m 

%>(£) = - J> fc | 2 log |c fe | 2 . (23) 
fc=i 

We refer to this quantity as the IU entropy of the state vector for the decomposition D, since it has previously 
been considered by Ingarden and Urbanik p7[ , albeit in a very different context. 
It is now possible to state our choice of preferred decomposition: 

Given a distinguished factorization F, the preferred decomposition of the dynamical state vector \ip) is 
the one that minimizes the IU entropy of from among all product decompositions with respect to F. 

By choosing the product decomposition that minimizes the IU entropy, we are choosing the interpretation where 
the probability distribution over the possible property state vectors is as narrow as possible. Moreover, since the 
minimum IU entropy (from among IU entropies for product decompositions) is zero if and only if \ip) is a product 
state, it can be thought of as a measure of the entanglement of l^) with respect to the distinguished factorization. 
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The strongest motivation for such a choice of preferred decomposition is that it appears very promising in securing a 
solution to the measurement problem and in satisfying the faithfulness criterion, as will be demonstrated in sections 
5.2 and 5.3. We do not however offer any a priori justification of the principle. 

Implementing the proposal requires solving the minimization problem for a given dynamical state vector and a 
given choice of distinguished factorization. If the distinguished factorization is bi-partite, the solution is given by the 
following theorem. 

Theorem 3 Suppose \ip) is any vector in a Hilbert space with a bi-partite distinguished factorization Fbi. Any 
decomposition of \ip) that is bi-orthogonal with respect to Fbi minimizes the IU entropy from among all product 
decompositions of with respect to -Fbi- 

The proof of this theorem is relegated to appendix A. In the case of an n-partite distinguished factorization, with 
n > 2, we have not yet found a solution to the minimization problem for all state vectors. However, the bi-partite 
result can be used to identify the preferred decomposition for some state vectors, as follows. 

Theorem 4 Suppose \ip) is a vector in a Hilbert space with an n-partite distinguished factorization, F n , where 
n > 2. If there exists a decomposition of \ip) that is a product decomposition with respect to F n and that is 
a bi-orthogonal decomposition with respect to some bi-partite coarse-graining of F n , then this decomposition 
minimizes the IU entropy from among all product decompositions with respect to F n . 

Proof. Suppose Ft,; is a bi-partite coarse-graining of F n . The set S n of decompositions of that are product 
decompositions with respect to F n is a subset of the set Shi that are product decompositions with respect to Ft,;. 
We can denote this by S n C S bi . Moreover, suppose Z?™ m (Z? b I j ln ) is the decomposition that minimizes the IU entropy 
from among all the elements of S n (Shi)- Theorem 3 shows that for every state vector \tj}), D^ ln is the bi-orthogonal 
decomposition of \tp). For certain state vectors, it may happen that F)^ 1 " lies among the elements of S n . Since we 
know that D™ n minimizes the IU entropy from among all the elements of Sbi, and S n C Sbi, it follows that in this 
case D™™ also minimizes the IU entropy from among all the elements of S n . Thus, in this case F)™ m = F)^ 1 ". QED. 

Theorem 4 is not a complete solution to the minimization problem because there exist state vectors for which F)JJ] ln 
does not lie among the elements of S n . Further work is required to determine the decomposition that minimizes the 
IU entropy in such cases. 

We note that in the proof of theorem 3, presented in appendix A, the only relevant feature of the IU entropy is 
that it has the form Y^k=i /(l c fe| 2 ) f° r some concave function /. It follows that one would obtain the same results if, 
instead of minimizing the IU entropy, one minimized any other entropic quantity having this form. However, there is 
no guarantee that this insensitivity to the choice of entropic quantity persists in the more general case of state vectors 
for which theorem 4 does not apply. 

A possible difficulty with the minimal entropy proposal as it stands has to do with the uniqueness of the preferred 
decomposition. It is well known that the bi-orthogonal decomposition of a state vector is not unique when the 
eigenvalues of the reduced density operator for one of the subsystems are degenerate. It follows from theorem 3 that if 
the distinguished factorization is bi-partite, then the decomposition that minimizes the IU entropy may not be unique, 
and the minimal entropy proposal may fail to uniquely specify a preferred decomposition. For instance, this occurs 
if the dynamical state vector is the EPR-Bell state for two spins \ip) = 2~ 1 / 2 (|+a) |— a) + |— a) |+a)). This difficulty 
persists in the case of an n-partite distinguished factorization, F n , where n > 2, since there are dynamical state 
vectors for which theorem 4 applies and the decomposition that minimizes the IU entropy is non-unique; an example 
being a tensor product of EPR-Bell states. It should be noted however that a degeneracy among the eigenvalues 
of the reduced density operator for one of the factor spaces of F n does not always lead to a non-unique preferred 
decomposition. For instance, if the dynamical state vector has a decomposition that is n-orthogonal with respect to 
the factorization F n , then it follows from theorem 4 that this decomposition minimizes the IU entropy, and since the 
n-orthogonal decomposition is unique for n > 2 pq ], so is the preferred decomposition. It is an open question whether 
the minimization of the IU entropy leads to a unique preferred decomposition when the dynamical state vector is such 
that theorem 4 does not apply. 

It is useful to distinguish two cases of non-uniqueness of the preferred decomposition: an instantaneous non- 
uniqueness, occurring at an isolated moment in time, and an extended non-uniqueness, occurring over a finite interval 
of time. If the constraint of analyticity (defined in Eq.(|l3|)) holds for the minimal entropy proposal, then the instanta- 
neous non-uniqueness problem can be solved easily: the preferred paths at the moment of non-uniqueness are simply 
taken to be the limit of the preferred paths at adjoining times. This is the same solution as was proposed in the 
context of the atomic modal interpretation by Bacciagaluppi and Dickson . The extended non-uniqueness problem 
is not so easily solved. One possible approach to the problem is to argue that cases wherein there is an extended 
non-uniqueness have negligible probability. Since such an argument has been made for the occurrence of a non-unique 
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bi-orthogonal decomposition by Bacciagaluppi, Donald and Vermaas [g9[, this result can be applied to the minimal 
entropy proposal in cases where theorem 4 applies. 

Finally, we turn to the issue of dynamics. Given theorem 4, it is possible to show that the minimal entropy proposal 
satisfies the constraint of analyticity in some cases. In particular, if the dynamical state vector evolves in such a way 
that it has a bi-orthogonal decomposition with respect to some coarse-graining of the distinguished factorization for 
a finite interval of time, then Eq.([l3|) can be satisfied for that interval. The reason is that the vector elements of 
a bi-orthogonal decomposition are analytic functions of time, as has been shown by Bacciagaluppi and Dickson [J7j. 
It remains an open question whether for arbitrary dynamical state vectors the decomposition that minimizes the 
IU entropy, considered as a function of time, satisfies the analyticity constraint. If this is indeed the case, then the 
entropy minimization rule defines a set of preferred paths. 

Given such a set of paths, denoted by \\4>k{t))}t--\ , we must choose the form of the dynamics from among all 
possible solutions for Jkj(t) and Tkj(t) in Eqs. (|l7|), ( |l8a| ) and ([lj]). We follow Bacciagaluppi and Dickson in 
choosing: 

J kj (t) = 21m (Mt)\H - H(t)\4>j(t)) <&(t)M*)>] , (24) 

and 

r fci (t)= m ax{0,^}. (25) 
Pj\ l ) 

This is a generalization to time-dependent preferred decompositions of the choice made by Bell 0] , Vink p3] and Bub 
H . Since the inequality in Eq. (^) is saturated, this choice of Tkj (t) minimizes the degree of stochasticity for a given 
form of the current. Such a choice is motivated by the fact that classical mechanics, which is deterministic, must be 
obtained as a limit of quantum mechanics. 



B. The quantum measurement problem 



We now consider whether the minimal entropy proposal solves the quantum measurement problem. Although this 
term is often taken to refer to the whole cluster of conceptual difficulties surrounding measurement, we shall use it to 
refer to the particular problem of deriving operational quantum mechanics from a realist no-collapse interpretation. 
To consider the problem, we must introduce a quantum mechanical model of the measurement procedure, that is, a 
model of the interaction between the degrees of freedom of the system under investigation, the apparatus, and the 
environment. We discuss both single measurements and sequences of measurements. 



1. Single measurements 



Following the notation introduced in section 3.1, we consider the measurement of a Hermitian operator V, belonging 
to a Hilbert space 7i , the eigenvalues of which are non-degenerate and the eigenvectors of which are denoted by 
{|yfc)}fcLi- Assuming the preparation procedure is associated with a state vector J2k°k \ fk) , where ^2 k \ck\ 2 = 1, 
operational quantum mechanics predicts, via the Born rule, that the measurement will have outcome k with probability 
|cfe| 2 - 

We now consider a quantum mechanical model of the measurement process. The system under investigation is 
called the object system and is assumed to be microscopic. This is made to interact with a macroscopic apparatus, 
associated with a Hilbert space Ti. A , which in turn interacts with a macroscopic environment, associated with a Hilbert 
space TL E . Given an initial state vector in Ti. s <8> Tt A 8 TL E , one could in principle determine the evolution of the total 
system using the full microscopic Hamiltonian. 

In practice of course the problem is far too complex to be solved exactly. Nonetheless, there is a set of standard 
toy models of measurement that are commonly used to investigate realist interpretations. These models adopt some 
simplifying assumptions about the initial state and the form of the evolution. Specifically, it is assumed that the 
object system, apparatus and environment are all initially uncorrelated, so that the initial dynamical state vector has 
the form \ipi~) <S> \Aq) (g> \Eq) , a product state with respect to the factorization {H s , H. A , H. E } of the Hilbert space. 
The dynamics is assumed to be such that 

\<pk) ® \Aq) ® \Eo) ^ Wk) ® \A k ) ® \E k ) , (26) 
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where {|^4fc)}fcLi is a set of orthonormal vectors for the apparatus, {{Ek)}™^ is a set of orthonormal vectors for the 
environment, and {\(pk)Yk = i is a set of normalized but possibly non-orthogonal vectors for the object system, and 
where £ i— >' denotes the mapping corresponding to the unitary evolution. 

If the initial state for the object system is ^2 k Ck \tfik)i the final dynamical state vector for the total system, given 
Eq.(p6|) and the assumption that the evolution is linear, is 

m 

fc=i 

We are now in a position to ask whether a given realist no-collapse interpretation falls prey to the quantum 
measurement problem within this model. We begin by illustrating the problem in the traditional manner, specifically, 
in the context of the simplest realist no-collapse interpretation one can imagine: one where the property ascriptions for 
systems are fixed by the orthodox rule, defined in Eq.(^J). Such an interpretation has been called the 'bare theory' by 
Albert p0| . Within the framework of section 4, it corresponds to adopting the trivial decomposition of the dynamical 
state vector as preferred (the trivial decomposition of \il>(t)) is simply {(1, \ip(t)))}). 

Consider first a case where Ck ^ for only a single value of fc, that is, where the initial state vector of the object 
system is an eigenstate of V. The final state vector is then of the form \(pk) <8> \Ak) <8> \Ek)- By the orthodox rule and 
the reductionist rule, the preferred projector for the property ascription to the apparatus is PA k ■ If the bare theory 
is to reproduce the predictions of operational quantum mechanics in this case, then the property associated with the 
projector P/^ k must be such that the apparatus can be accurately described as 'indicating outcome fc' (for instance, 
if the apparatus indicates the outcome by a digital display, Pa h could correspond to the property of displaying the 
number fc). We refer to this as the assumption of ontological correspondence. 

If, on the other hand, the initial state is such that ^ for more than one value of fc, then the final state vector 
is of the form J\ Ck \<pk) ® l^-fe) ® \Ek) , where Y^'k indicates a sum over values of fc for which Ck ^ 0. In this case, 

the preferred projector for the property ascription to the apparatus is J2 k PA k , while no projector of the form PA k 
receives the value 1. Thus, even given the assumption of ontological correspondence, the bare theory does not predict 
that the apparatus indicates the outcome fc for any value of fc for which Ck ^ 0. Hence the bare theory does not 
reproduce the predictions of operational quantum mechanics. This is the quantum measurement problem. 

We now specify the assumptions under which the minimal entropy proposal solves this problem. These involve the 
nature of the distinguished factorization, which we have not yet specified. Whatever it might be, the distinguished 
factorization should be defined in terms of primitives of the theory and selected by physical principles, for instance, 
from considerations of symmetry. We do not here present an argument for the identity of the distinguished factor- 
ization, however a discussion of the issue can be found in Dicks J26| , wherein it is argued that a necessary condition 
on this choice is that the factor spaces carry an irreducible representation of the space-time group (the Galilei group 
in nonrelativistic quantum mechanics). For the present, we insist only that the distinguished factorization, which we 
denote by F, has the factorization {H s , TC A ,TC E } as a coarse-graining, and that its elements correspond to microscopic 
degrees of freedom (for instance, they could correspond to degrees of freedom of elementary particles). 

Now, suppose that F is such that all the vectors in the sets {\<fk)}kLu {|^fc)}feLu an d ar e product states 

with respect to it. One can then determine that the preferred decomposition of \ip& na ,i) is 

Ainal = {(Cfe, \<fi k ) <8> \A k ) <8> |£fc))Kll ■ 

This follows from theorem 4 and the fact that -Dfi na i is a product decomposition with respect to F that is bi-orthogonal 
with respect to the coarse-graining Fbi = {H s ® H A ,H E } of F. It then follows from the Born rule constraint that, 
with probability |cfe| 2 , the property state vector is 

|$fi„al) = Wk) <8> \A k ) ® \E k ) . 

Using the reductionist rule we find that the projector P^ k is determinate and receives the value 1 with probability 
I Cfe | 2 ■ Finally, by the assumption of ontological correspondence, the apparatus has the property of indicating outcome 
fc with probability \ck\ 2 ■ This is in agreement with the predictions of operational quantum mechanics. 

Thus, we have obtained a solution to the measurement problem within the standard model of measurement. In so 
doing, we have had to assume that when the initial state vector of the object system is an eigenstate of V, the final 
dynamical state vector for the total system is unentangled with respect to the distinguished factorization. 

The assumption of no entanglement between the distinguished factor spaces of the apparatus and the environment 
is not particularly realistic, given that these factor spaces are taken to correspond to microscopic degrees of freedom, 
and typical interactions between the apparatus and the environment are likely to entangle these degrees of freedom. 
However, this assumption can be relaxed somewhat without changing any of our conclusions, as we now demonstrate. 
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We consider a model of measurement wherein the evolution is of the form: 

M 

\<p k ) ® \Aq) ® \E ) ^ \<p k ) <g> ^ f k ^ (\A ktfl ) ® |f? fci/t » . (28) 

where |/fc, M | 2 = 1 and where {{|j4fe,/ 1 )}jJLi}fcL 1 and {{|^fc,^)}*Li}fcLi are orthonormal sets of vectors that are 
product states with respect to F. The assumption of ontological correspondence in this case becomes the assumption 
that for every value of fi, the projector P^ k corresponds to the apparatus indicating outcome k. That there can 
be more than one projector corresponding to indicating a particular outcome is not unreasonable since there can be 
many different microscopic configurations of the apparatus leading to the same overall macroscopic appearance. 

An arbitrary initial state vector for the object system, c k \ fk) > leads, via Eq. (|28|) , to the following final dynamical 
state vector for the total system 

m M 
fc=l ft=l 

The preferred decomposition of this state vector is 

since this is a product decomposition with respect to F that is bi-orthogonal with respect to F bi . It follows that the 
property state vector is 

l*final> = Wk) ® \A k>ll ) ® \E k4l ) 

with probability \c k fk,^ ■ By the reductionist rule, the projector P& k is determinate and receives value 1 with 
probability \ckfk,^\ 2 ■ Finally, by the assumption of ontological correspondence, the apparatus has the property of 
indicating outcome k with probability \c k f kilJr \ — \ck\ 2 , in agreement with operational quantum mechanics. 

Note that the model of measurement provided by Eq.(|2§|) can also describe error-prone measurements. This occurs 
if for some values of (J,, PA k „ corresponds to the property of indicating an outcome k' ^ k, or to the property of 
indicating a malfunction. Furthermore, this model can incorporate measurements described by positive operator- 
valued measures(POVMs) p^j. This follows from the fact that such measurements are implemented by adjoining 
an ancilla to the system under investigation and measuring a projector- valued measure(PVM) on the composite. By 
including the ancilla in our definition of the object system, the model presented above can describe these measurements. 
Note however that we are restricted to PVMs whose eigenvectors are product states with respect to the distinguished 
factorization. 

Despite the possibility of incorporating some error-prone and POVM measurements, the model of measurement 
provided by Eq.j2"s|) is still not the most general or realistic. Although it is true that an arbitrary state vector has 
many decompositions into product states with respect to the distinguished factorization, it is not necessarily the case 
that any of these decompositions are bi-orthogonal with respect to a coarse-graining of the distinguished factorization. 
For instance, if any of the vectors in the set {|¥>fe)}fc=i are entangled with respect to the distinguished factor spaces 
of H s , then theorem 4 fails to apply if the final dynamical state vector is of the form of hAfinai)- Since the problem 
of minimizing the IU entropy for arbitrary state vectors has not yet been solved, it is not clear what the preferred 
decomposition will be in this case and whether the measurement problem is resolved or not. 

It is nonetheless interesting to consider one particular type of modification of the evolution where the only change 
from the model considered above is that the set of vectors {{l^fc^j^LilfcLi (describing the states of the environment 
that are relative to the apparatus states {{|^4fe.^)}*Li}feLi) is only approximately orthogonal. This is an instance 
where theorem 4 may fail to apply. However, the difference between IV'finai) wnen the elements of {{|-Efc,/i}}^Li}fcLi 
are orthogonal and when they are very nearly orthogonal, is not significant. Thus, if the preferred decomposition does 
not depend sensitively on small variations in the dynamical state vector, the preferred decomposition in the nearly 
orthogonal case should be 'close to' Z?^ nal , and it is then likely that the apparatus will be assigned an ontology that 
is 'close to' the one it receives for the orthogonal case. We see therefore that whether or not there is a measurement 
problem in this case depends on whether or not there is such sensitive dependence. The answer to this question must 
await further progress on the problem of the minimization of the IU entropy.^] 



3 The analagous question in the Vermaas-Dieks version of the modal interpretation is whether the spectral resolution of a 
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Finally, we note that the assumption that the apparatus and environment are initially unentangled is also an 
unrealistic feature of the standard model of measurement. For that matter, the assumption that the composite of 
system, apparatus and environment is unentangled with the rest of the universe may not be realistic either. However, 
this difficulty is not unique to the minimal entropy proposal. Every realist no-collapse interpretation must contend 
with the fact that the dynamical state vector for the universe is in general not factorizable with respect to subsystems 
that have interacted in the past, even if this interaction is quite weak. Further work is required to determine whether 
the predictions of the minimal entropy proposal remain satisfactory when these assumptions are relaxed. 

2. Sequences of measurements 

We now demonstrate the extent to which the minimal entropy proposal is in agreement with operational quantum 
mechanics for sequences of measurements. Consider in particular the sequence of two measurements described in 
section 3.1. Recall that the first measurement is of a variable V with eigenstates {\<Pk)}™=i, the second measurement 
is of a variable V' with eigenstates {1^4)}^!, and the state prepared by the first apparatus given outcome k is denoted 
by \(fk) ■ 

In the last subsection we considered two distinct models of measurement which differed in the extent to which the 
apparatus and the environment became entangled due to their interaction. In this subsection, we consider only the 
simpler of the two models. The reader can verify that the more realistic model leads to the same conclusions. 

There are now two apparatuses, and an environment for each. We denote their Hilbert spaces by TC A1 ,TC A2 ,7i E1 , 
and Ti E2 respectively, and we distinguish state vectors for the two apparatuses(environments) by a superscript. It is 
again assumed that the object system, the two apparatuses and the two environments are all initially uncorrelated. 
The distinguished factorization F is assumed to have {H s ,H A \H A \H E1 , H E2 } as a coarse-grammg. 

We assume that the first measurement is well described by Eq.(f26|) with the exception of a change of notation: 
^4o) , \Eq) , \Ak) and \Ek) become , J5q) , \A\) and \E\) in order to specify that the object system interacts 
with the first rather than the second apparatus. We assume that the second apparatus and its environment remain 
uncorrelated with the rest of the system and each other during this first measurement. It follows that the dynamical 
state vector for the total system after the first measurement is 

IVfmal l) = Ck ® \ A D ® \ E l^j ® 1^2) ® \ E o) ■ (29) 

Suppose that |Aq) and \E 2 } are product states with respect to F. If we make all the same assumptions about 
\A\} and \E\^ as were made for \Aj~) and \Ek) in the previous subsection, and if we use the bi-partite factorization 
{H s ® U A1 ®7i A2 , H E1 ®7i E2 } in place of the bi-partite factorization {H s ® H A , TL E } in the arguments found therein, 
then it is straightforward to show that the preferred decomposition of IV'fmai l) is 

Ai„ai i = {(c k , \0k) ® |4) ® \El) ® \A 2 ) ® \E 2 ))}™ =1 . (30) 

We conclude that with probability |cfc| 2 , the first apparatus indicates outcome k, while the second apparatus remains 
ready to measure. 

Now assume that the second measurement is also well described by Eq. (|26|) with the notational change that 
\A ) , \Eq) , \Ak) and \Ek) become A§) , \E 2 ) > \A 2 ) and |£^) since the object system is now interacting with the 
second apparatus, and where the vectors for the object system acquire a prime since the second measurement is of V 
rather than V. For simplicity, we take this second measurement to be non-disturbing, so that \(p' k ) = \<p' k ) . Assume 
also that the first apparatus and its environment have no interactions during this measurement. It then follows that 
the dynamical state vector after the second measurement is 

m rci 

Ivan* 2> = EE Cfe 4 Wi) ® \ A D ® l^fc) ® K 2 ) ® \ E *) . ( 31 ) 
k=i j=i 



density operator is sensitive to small changes in the density operator. Bacciagaluppi, Donald and Vermaas J29[ have shown 
that this does in fact occur when the density operator has nearly degenerate eigenvalues. 
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where the coefficients {dj}^ =1 are denned by \(pk) = Y^k=i $ \<Pj) ■ Again, if we make all the same assumptions about 
and \E 2 ) as were made for \Aj) and \Ej) in the previous subsection, and if we use the bi-partite factorization 
{H s ® H A1 (g> H A2 ,H E1 (g> H E2 } in place of the bi-partite factorization {TL S <g> H A ,H E } in all the arguments found 
therein, the preferred decomposition of |^g na i 2) is found to be 

Ai„al2 = {{(<%<#, |^.) ® |4) «8> \E\) ® |4) ® l^ 2 ))}^}^ ■ ( 32 ) 

We can therefore conclude that there is a probability | Cfc | 2 that the first apparatus indicates outcome k and the 
second apparatus indicates outcome j after the second measurement. It follows that the probability of the second 
apparatus indicating outcome j given that the first apparatus indicates outcome k after the second measurement is 

K fe l 2 = i<¥*>i 2 - 

However, we have still not determined the probability for the second apparatus to indicate outcome j after the second 
measurement given that the first apparatus indicates outcome k after the first measurement, which is the quantity 
specified by the generalized Born rule. The problem is that it has not been shown that the outcome indicated by 
the first apparatus is stable over time. Whether it is or not depends on the dynamics of the property state vector, 
which is determined by Eqs. ( |24|) and (pq) . Now although it may be reasonable to assume that the dynamical state 
vector after a measurement is such that theorem 4 applies, it is unlikely that this theorem applies during the entire 
interaction leading up to this outcome. Given this, we cannot at present determine the time-sequence of preferred 
decompositions nor the preferred paths through Hilbert space defined by this sequence. Since Eqs. ((2J) and (|25|) 
depend on the identity of these preferred paths, we cannot at present determine the dynamics of the property state 
vector. 

Thus, for the moment we simply assume that within the minimal entropy proposal, the apparatus is never described 
as 'jumping' between macroscopically different readings. We call this assumption stability. Given stability, the minimal 
entropy proposal reproduces the predictions of the generalized Born rule. 



C. The Faithfulness criterion revisited 



We now reconsider the experiment of section 3.1 in the context of the minimal entropy proposal. Since this 
experiment involves a sequence of two measurements, we can make use of the model of measurement presented in 
the previous section. Consider the property ascription at the time t, after the first measurement. The dynamical 
state vector is |V"final 1)1 defined in Eq.(|2|), and its preferred decomposition is £)g na j 1, defined in Eq.(|30|). If the first 
apparatus indicates the outcome k at time t, the property state vector must be the fcth element of -Dfinai 1, that is, 
^finai 1) = \<fk) ® 14) ® Pfc)® |^o)® |-^o) ■ The critical feature of the experiment of section 3.1 is that the vector \(pk) 
that is prepared when the first apparatus indicates outcome k, is an eigenstate of V(fe), the variable measured by the 
second apparatus. It follows that the variable ® / (where / is the identity operator for Ti. 
is determinate and has value vi k \ x at time t. Finally, it follows from the reductionist rule that V< 



(fc) 



Ti A2 ®TL E1 ®H E2 ) 
is determinate and 



has value v^k),i a t time t. This is precisely what is required in order for the faithfulness criterion to be satisfied. 

It should be noted that since the variable measured by the second apparatus depends upon the outcome of the 
first measurement, the initial state of the second apparatus may well be different for different outcomes of the first 
measurement. Thus, rather than the first measurement interaction being described by Eq.(j2^), it may be described 
by 



\fh) ® ® \El) ® \A 2 ) ® \E 2 ) 
Wk)®\A\)®\El) 



A 2 



E, 



(fe),o 



(33) 



where { 



A 2 



}T=i and { 



p2 



IfcLi are orthonormal sets of vectors, and 



A 2 



corresponds to the apparatus 
and 



A 2 



IP? 

^(k),0 



being ready to measure the variable V^y In any event, by making the same assumptions for 

were made for \Aq} and \E 2 ^ in the last subsection, one can show that the minimal entropy proposal is in agreement 
with the predictions of operational quantum mechanics even when the nature of the second measurement depends on 
the outcome of the first. 

We end this section with a discussion of the case wherein the second measurement is of a variable whose eigenstates 
are not all product states with respect to the distinguished factorization F. For such measurements, the faithfulness 
criterion cannot be satisfied within the minimal entropy proposal. The reason is as follows. Suppose \ip) is an 
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eigenstate of the measured variable that is entangled with respect to F. If \ip) is prepared by the first measurement 
and measured by the second, then the faithfulness criterion requires that the projector P v be determinate with value 
1 immediately prior to the second measurement. However, for this to occur the property state vector must be an 
eigenstate of P Vl and hence must be entangled with respect to F. But the property state vector is always a product 
state with respect to F in the minimal entropy proposal. 

The failure of the faithfulness criterion for such measurements in the context of the atomic modal interpretation 
of Bacciagaluppi and Dickson Q| and Dieks [^6) has been discussed by Dieks, and also by Vermaas |52|. These 
authors have suggested that an explanation of the outcomes of these measurements might be provided by dispositional 
properties or collective effects of the composite. This explanation can also be invoked in the context of the minimal 
entropy proposal. 

VI. CONCLUSIONS 

In modal interpretations, the properties of a system are given by a specification of the set of determinate variables 
(the ontology) and the value ascription to these variables, jointly referred to as the property ascription. Such inter- 
pretations also assume that the property ascription which obtains at a given time is just one of several possibilities. 
There is always a unique 'smallest' projector which receives the value 1 in each of these possible property ascriptions, 
which we call the preferred projector for that property ascription. 

We have shown that these preferred projectors must be non-orthogonal if one seeks to satisfy the faithfulness 
criterion, that is, if one seeks to explain the outcomes of certain perfectly predictable measurements in terms of 
pre-existing properties of the system under investigation. The possibility of such an explanation has historically been 
a strong motivation for the modal approach. 

We have also shown that non-orthogonal preferred projectors are inconsistent with the assumption, common among 
previous modal interpretations, that at a given time there is only a single possible ontology. In order to consider non- 
orthogonal preferred projectors, we have developped a framework for modal interpretations wherein at a given time, 
the possible property ascriptions may differ with respect to ontology. As is required for any modal interpretation, the 
state vector appearing in the Schrodinger equation, which we call the dynamical state vector, does not uniquely fix 
the property ascription. Rather, a preferred decomposition of the dynamical state vector into a sum of orthogonal 
vectors must be specified at every time, and a single element of this decomposition, dubbed the property state vector, 
fixes the property ascription. The property state vector evolves stochastically according to a Markovian dynamics. 
Finally, subsystems receive only those properties they inherit from the total system by the reductionist rule. 

It is of course possible to generalize this framework in many ways. One could consider non-Markovian dynam- 
ics, alternatives to the reductionist rule, and even non-orthogonal decompositions of the dynamical state vector. 
Nonetheless, we feel that the framework presented is a natural starting place for the interpretive program at hand. 

Within the context of this framework, we have presented a novel proposal for the preferred decomposition. The 
proposal assumes that there is a distinguished set of subsystems of the universe, that is, a distinguished factorization 
of the total Hilbert space into a tensor product of Hilbert spaces^. It is also assumed that the preferred decomposition 
is a product decomposition with respect to this factorization. In the case of a distinguished factorization that is 
bi-partite, it is then natural to follow previous authors in identifying the bi-orthogonal decomposition as preferred. 
However, the obvious generalization of the bi-orthogonal decomposition to an n-partite distinguished factorization, 
namely the n-orthogonal decomposition, does not exist for all state vectors, as shown by Peres [^8). The preferred 
decomposition in our proposal is the one that minimizes the IU entropy from among all product decompositions 
with respect to the distinguished factorization. This decomposition always exists and turns out to be equal to the 
n-orthogonal decomposition when the latter exists. It therefore can be thought of as a natural generalization of the 
bi-orthogonal decomposition to n-partite systems. 

At present the strongest justification for the minimal entropy proposal is its success in dealing with the quantum 
measurement problem and in satisfying the faithfulness criterion. The measurement problem is resolved for a wide 
variety of measurements including certain types of non-ideal measurements, in particular, disturbing measurements, 
assuming particular microscopic models of the apparatus and environment. Within the same microscopic models, the 
faithfulness criterion is satisfied for sequences of disturbing measurements. It is this feature of the minimal entropy 
proposal that sets it apart from previous modal interpretations. 



4 Other modal interpreters M |33| |7| have been led to this assumption by considerations of the correlations between the 
properties of a system and its subsystems. 
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The solution of the measurement problem relies on the assumption of ontological correspondence, that the ontology 
of macroscopic systems corresponds to our everyday perceptions of them, and the assumption of stability, that the 
dynamics of the properties assigned to macroscopic systems are consistent with our stable perceptions of them. Ideally, 
these features would be demonstrated rather than assumed. However, the demonstration of ontological correspondence 
is likely to require a better specification of the distinguished factorization than has been provided in the present work, 
while the demonstration of stability must await progress in solving the entropy minimization problem in cases where 
theorem 4 does not apply. Progress on the minimization problem will also help to determine whether one can solve the 
quantum measurement problem for more general types of measurements than the ones considered here, for example, 
measurements of variables whose eigenstates are entangled with respect to the distinguished factorization. In addition, 
such progress is required to determine what the proposal has to say about more realistic models of measurements. 
Finally, it may indicate whether the IU entropy is the correct quantity to minimize in the rule for determining the 
preferred decomposition, or whether some other entropic quantity might be a better choice. 

So we see that there remain many unanswered questions. In addition to these, there are difficulties with the minimal 
entropy proposal. For one, the product decomposition that minimizes the IU entropy may fail to be unique for certain 
dynamical state vectors. It may be that further technical work will show that this is not a problem after all. For 
instance, dynamical state vectors for which the preferred decomposition is non-unique for a finite interval of time may 
constitute a set of measure zero. Another difficulty is that the faithfulness criterion explicitly fails to be satisfied in 
measurements of variables whose eigenstates are entangled with respect to the distinguished factorization. Given that 
not all Hermitian operators necessarily correspond to variables that can be measured p4| |35f| , it may happen that 
with a suitable choice of distinguished factorization, the measurements for which the faithfulness criterion fails to be 
satisfied are precisely those which are impossible to implement. On the other hand, it may be that this problem cannot 
be avoided within the minimal entropy proposal, but can be avoided if some other choice of preferred decomposition 
is made. As a third possibility, one might find that the faithfulness criterion for variables with entangled eigenstates, 
cannot be satisfied by any interpretation within the framework we have set out. Justifying any one of these answers 
would certainly be an interesting result, and motivates further investigation of these issues. 

The use of a preferred decomposition, sometimes called an 'interpretation basis', has been viewed by some as 
necessary within interpretive strategies distinct from modal interpretations. This has been suggested by Deutsch 
p6| in the context of the many-worlds interpretation and by Kent and McElwaine |?7| in the context of consistent 
histories. A preferred decomposition might also be useful in nonlinear modifications of quantum mechanics. Thus, the 
preferred decomposition of the minimal entropy proposal may well be of relevance to such interpretive strategies as 
well. In any event, a mathematically precise proposal, even though not without problems, can be useful in stimulating 
progress on interpretive issues, as is evidenced by the recent profusion of work on modal interpretations. We hope 
that the minimal entropy proposal will not be an exception in this respect. 
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VIII. APPENDIX: PROOF OF THEOREM 3. 



It will be assumed throughout that the distinguished factorization is bi-partite, and the two factor spaces are 
denoted by Tt A and TL B . All references to product decompositions are to be understood as product decompositions 
with respect to this factorization. We say that a product decomposition {(ck, \xt) ® l&k * s A-orthogonal (B- 

orthogonal) if the set of vectors {|Xfe)}feLi {{\4>k )}T=i)^ s orthogonal. A bi-orthogonal decomposition (also called a 
Schmidt decomposition) is one that is both A-orthogonal and B-orthogonal. We shall make use of several well-known 
properties of bi-orthogonal decompositions, an exposition of which can be found in Ref. [ p8| . Finally, we remind the 
reader that S^\(D) denotes the IU entropy of l^) for the decomposition D, which is denned by Eq.(23). 

Theorem 3 follows from two lemmas: 

Lemma A.l For any vector , if D is an arbitrary product decomposition of \tp) , then there always exists an 
yl-orthogonal decomposition of \ip) , I?A-orth, such that 



Lemma A. 2 For any vector \tp) , if -D^-d-th is any A-orthogonal decomposition of , and -Dbi-orth is any bi- 
orthogonal decomposition of l^) , then 
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Together these imply that for any vector \ip) , if D is an arbitrary product decomposition of \ip), and -Dbi-orth is any 
bi-orthogonal decomposition of , then 

<S|i/>)(A>i-orth) < S\^)(D), 

which is simply theorem 3. 

The task at hand is therefore to prove lemmas A.l and A. 2. We begin by reviewing a partial ordering relation 
among probability distributions, namely that of majorization [[59|, which has recently seen application in the study 
of entanglement purification fief] . Suppose p = (pi,p2, ...,p m ) and q = (qi, q2, q m ) are two m-element probability 
distributions. By definition, p majorizes q if for every I in the range {1, .., m}, 

I>i>E<i 

fc=i fe=i 

where p\ indicates the fcth largest element of p, so that p[ > p\ > ... > p\ n . 

The notion of majorization is important in the present investigation because of the following well-known result 
(Theorem II. 3.1 of Ref. p^): The following two conditions are equivalent 

(i) p majorizes q. 

m rn 

(ii) f(pk) < E f{qk) for all concave functions /. 

k=l k=l 

Since —xlogx is a concave function of x, it follows that H(p) <H(q) if and only if p majorizes q, where H(p) is 
the Shannon entropy of a probability distribution p, defined in Eq.(p3). Now consider two decompositions of a state 
vector, D = {(cfe, |^fe)}fcLi and D' = {(c' k , |^)}? 1 . Although these may have different cardinalities, they can be 
associated with probability distributions of equal cardinality by simply adding zeroes. Specifically, if m > m' , then 
D is associated with the distribution pk = \ck\ for k G {1, ...,m} and D' is associated with the distribution qk = 
\c' k \ for k E {1, ...,m'} and qk = for k G {m' + 1, ...,m}. Since the IU entropy of for the decomposition D(D') 
is simply the Shannon entropy of p(q), it follows that S^) (D) < S 1 ^) (£)') if p majorizes q. 
In order to facilitate the proof of lemma A.l, we set out two minor lemmas. 

Lemma A. 3 Consider two probability distributions p = (pi,f>2j ■■■,Pm) and q = (qi, q2, q m ). If for every I in the 
range {1, m}, 

i i 

k=l k=l 

then p majorizes q. 

Proof. This result follows from the definition of majorization and the fact that 

i i 

fe=i fc=i 

for every / in the range {1, ...,m}. This inequality is obviously true since the /-element subset of p with the largest 
sum must be the subset containing the / largest elements of p. QED. 

For the second minor lemma, we make use of some notational conventions introduced in the text: Ps denotes the 
projector onto the subspace 5, and l Ps < Ps 1 ' denotes that S is a proper subspace of S' . 

Lemma A.4 If P s < P s , then (tp\ P s \ip) < (^| P s < \ip) ■ 

Proof. If Ps = Ps' 7 then the inequality is saturated. Otherwise, Ps < Ps' , and there exists a projector P<r such 
that Ps + Pt = Ps 1 - The desired inequality follows from the positivity of {ip\ Pt ■ QED. 
We are now in a position to prove lemma A.l. 

Proof of lemma A.l. An arbitrary product decomposition has the form D = {(dk, \<f>k) ® X^))}fcLi! where the 
lists of vectors {|<^fc and {\xk )}T=i are not necessarily orthogonal nor even linearly independent (although the 
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list of vectors {\(f> A ) ® |xf )}kLi * s orthogonal). The decomposition P defines an m-element probability distribution 



Q = {<7i, 92, q m }, where qk = \dk\ 2 . As before, let q\, denote the fcth largest element of q, and let 
denote the vectors associated with q\, . 



and 



xi A 



Now, identify every vector in the list { 4>i /IfeLi that cannot be obtained as a linear combination of vectors with 
lower indices from this list. Suppose there a number m! of such vectors, corresponding to a particular subset S of the 
indices {1,2, ...,to}, so that the set of vectors is denoted by { fij^^jkes- By definition, this is a linearly independent 

set. The remaining vectors are denoted by { (pl^^keS? where S is the set of indices that remain after removing the 

elements of S from {1,2, ...,to}. Obviously the elements of { 4>\^^}keS can au be written as linear combinations of 

the elements of { 4>\f^}k'es, k'<k- Finally, for future reference, we define g(k) as the number of indices k' in S such 
that k' < k. It is clear from the definition of 5* that g{k) < k. 

Let {\fJ,f)}'j^ 1 be the ordered set of orthogonal vectors that are obtained by applying the Gram-Schmidt orthogo- 



nalization procedure to { 



}kes, m order of ascending k. This new set yields an A-orthogonal decomposition of 



\ip) , -DA-orth = {(cj, \^f) ® |z^))}jli, where \vf) — (fif\"4>) /cj and Cj — \^ A \ip)\ . It also defines an m-element 

probability distribution p = (pi,p2, ...,p m ) where pj = \cj\ 2 for j in the range {1, ...,m'}, and pj = for j in the 
range {to' + 1, to}. 

Let P A denote the projector onto the ray spanned by 



i> A ) and for convenience define P A 



fnuii for j in the 



range {to',...,to}. The nature of the Gram-Schmidt orthogonalization procedure ensures that for every k G S, 



' - Y,f=l fj for some set of complex amplitudes {fj} 9 ^. Thus, (£gj> P A ) 



k j i-^i] 

lently, £g> 



bj, A \ , or equiva- 



Pfi > P}£- Since g(k) < k, this is trivially extended to £)* =1 Pfi > Pj£- Moreover, if k £ S, then 



J k 



' \ can be written as a linear combination of the elements of { 



for some set of complex amplitudes {fj} 9 j=i^\ where h(k) — m&Xk>£S,k'<k k' . It follows that Y^Li^^ P^ ^ P^ f° r 
all k E S. Since g(h(k)) < k, this is trivially extended to Y^=i Ppj — P$t' ^ f° uows therefore that for every k in the 
range {1, ...,to} we have £* =1 P A > P^ 4 . Now, since I B > P+f, we can infer that £* =1 P£ ®I B > P^® P+f , and 
by the orthogonality of the projectors in the set {Pj>f <8> P^ 8 }/ 5 —!, we conclude that X^j=i P^ ® ^ 
P^f for every k in the range {1, to}. 

Now we note that the probability distributions q and p are related to the projectors by qf = ( 



}fc'es, k'<k, so that 



4 A 



\^9((h(k)) I I A\ 
2^j=l h |Mj / 



,g((h(k)) 



4>i 



and 



Pj = (ip\ P A <8> I B \tjj) . From the inequality derived above together with lemma A. 4, we find that Y^j=\Vj > S;=i ij 
for every k in the range {1, ...,m}. By lemma A. 3, it follows that p majorizes q. QED. 
Finally, we prove lemma A. 2. 

Proof of lemma A. 2. An arbitrary ^4-orthogonal decomposition of \ip) has the form P^-orth = {{cj, ® 
\ v f ))}JLi> where the vectors {\/j,^)}'jL 1 are orthogonal, but {\vf}}™ =1 need not be orthogonal nor even linearly 
independent. A bi-orthogonal decomposition of \ip) has the form Pbi-orth = {(cj, \fi>f) ® \vf where both the 



vectors {\jx A )}™ =1 and { | ^J 4 ) J^Li form orthogonal sets. The probability distributions associated with each decompo- 

|c m | 2 ) and (|ci| 2 , |c 2 



3 

sition are (|ci| , |c 2 



l~ |2 



7 | 



■ ■,\Cfh\~) respectively (even if there is more than one bi-orthogonal 
decomposition for a particular state vector, these do not differ in their coefficients). For ease of comparison of these 
distributions, we add zeroes until the number of elements in each is equal to the dimensionality, d, of the Hilbert space 
TL A . Denote the resulting distributions by p and p respectively. We establish that (Pbi-orth) < S\^\ (P^-orth) by 
showing that p majorizes p. 

To begin, we express the probabilities as expectation values of projectors. We introduce an arbitrary orthogonal 
set of vectors {|m/ )}j=m+i which together with {|^/)}^Li form an orthogonal basis for the Hilbert space H A , and 
similarly for ^pf)}™^- Then, we have for all j in the range {!,..., d}, 



p 3 =Tr A (p A P A ), and 
p J =Tr A (p A P A ). 



Let the unitary operator that transforms the elements of {\pf)}j =1 to the elements of {|^)}^ =1 be denoted by U A 1 



so that 
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It follows that 



Pj =Tr A (p A U A ^P A U A ) 
= Tr A (U A p A U A ^P A ), 

where in the last step we have used the cyclic property of the trace. What distinguishes the bi-orthogonal decompo- 
sition from other A-orthogonal decompositions is that the projectors {Pn k }k=i diagonalize p A , 



A 

f'k ■ 



k=l 



Plugging this form of p into the expression for pj , we obtain 



P.i 



= EI^I 



Pk, 



k=l 



where U A k = (p 3 \U A \p k ) . By the unitarity of U A , we find that £\ U A k = (/2 fc | U A ^U A \p k ) = 1, and 



U A 
u jk 



U A U A ^ \plj) = 1. Thus, the transition matrix between the probability distributions p and p is doubly stochastic, 
from which it follows by a well-known result (theorem II. 1.9 of Ref. 139]) that p majorizes p. QED. 
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Measurement 1 Measurement 2 




Fig. 1 An example of a sequence of measurements upon a spin 1/2 particle for which the faithfulness criterion can only be 
satisfied if one adopts non-orthogonal preferred projectors. In the diagram, SG(n) denotes a Stern-Gerlach magnet oriented 
along the n axis. Dl and D2 are detectors. A and B denote the two possible locations at which the particle might be detected 
by Dl. The magnetic field after Dl is designed to map the state l-z) onto l+x). The axis n of the second Stern-Gerlach magnet 
is assumed to depend on the the measurement outcome at Dl. If the particle is found at A, then n is set to z, while if it is found 
at B, then n is set to x. Although the steering mechanism for the beams is not indicated, an appropriate mechanism can be 
devised (particularly since it can be taken to depend on the measurement outcome at Dl). 



